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ABSTRACT 

We consider the issue of hemispherical power asymmetry in the three- year WMAP data, adopting a 
previously introduced modulation framework. Computing both frequentist probabilities and Bayesian 
evidences, we find that the model consisiting of an isotropic CMB sky modulated by a dipole field, gives 
a substantially better fit to the observations than the purely isotropic model, even when accounting 
for the larger prior volume. For the ILC map, the Bayesian log-evidence difference is ~ 1.8 in favour 
of the modulated model, and the raw improvement in maximum log-likelihood is 6.1. The best-fit 
modulation dipole axis points toward (l,b) = (225°,— 27°), and the modulation amplitude is 0.114, 
in excellent agreement with the results from the first-year analyses. The frequentist probability of 
obtaining such a high modulation amplitude in an isotropic universe is ~ 1%. These results are not 
sensitive to data set or sky cut. Thus, the statistical evidence for a power asymmetry anomaly is 
both substantial and robust, although not decisive, for the currently available data. Increased sky 
coverage through better foreground handling and full-sky and high-sensitivity polarization maps may 
shed further light on this issue. 

Subject headings: cosmic microwave background — cosmology: observations — methods: statistical 



1. INTRODUCTION 

While the first-year results from the Wilkinson 
Microwave Anisotro py Probe (WMAP) experiment 
([Bennett et al.ll2003ft overall clearly supported the cur- 
rently popular inflationary cosmological model, describ- 
ing a flat, isotropic and homogeneous universe seeded 
by Gaussian and adiabatic fluctuations, a disturbing 
number of unexpected anomalies on large scales were 
reported shortly after the public data release. Per- 
haps the three most important ones were: 1) align- 
ments and symmetry fe atures among low £ multipoles 
(|de Oliveira-Costa et alj 12004 lEriksen et al.ll2004ab . 2) 
an apparent asymmetry in the distributi on of fluctua- 
tion power in two opposi ng hemispheres ([Eriksen et al.l 
l2004ri [Hansen et al.ll2004T ) , and 3) a peculiar cold spot in 
the s outhern hemisphere (|Vielva et alj 120041 ICruz et al.l 
2005). All of these features were subsequently studied 
extensively by independent groups, and all remain unre- 
solved to the present day. 

In March 2006 the three-year WMAP results were 
released, prompting researchers to revisit the anoma- 
lies detected in th e first-year data (iBridees et alJl2006t 
Copi et al.1 120061: [Jaffe et al.1 120061: lLand k. Magueiiol 
20061: iMart mez-Gonzalez et al.l l2006h . Of course, con- 



sidering that already the first-year data were strongly 
signal-dominated on the scales of interest, it should come 
as no surprise that most of these analyses concluded with 
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similar results as for the previous data, although different 
foreground handling could affect some results. 

The WMAP team paid particular attention to the 
question of large-scale power asymmetry in their analyses 
([Hinshaw et alJl2007b ISpergel et ajJl2007a|). Spe c ifically , 
in an early version of their paper, Spergel et al.l (|2007a[ ) 
approached the problem from a semi-Bayesian point of 
view, by defining a parametric model consisting of an 
isotropic and Gaussian CMB field modulated by a large- 
scale function. The power asymmetry anomaly was then 
addressed by a dipolar modulation field, and the low- 
£ alignment anomalies were studied with a quadrupole 
modulation field. However, due to several issues with 
this early analysis, several of which were first addressed 
by the present paper, the authors decided to remove 
the cor responding section fro m the final version of their 
paper ([Spergel et alj l2007bl ). One example is simple 
marginalization over non-cosmological monop oles and 
dipole s components, which was first done by iGordonl 
(2007) in an otherwise identical analysis. A second ex- 
ample was the limit ed harmonic range considered by 
ISpergel et alJ (|2007af ). Thus, we present in this Letter 
the first complete modulation analysis t hat covers the 
full ran ge of angular scales presented by lEriksen et al.l 
(|2004bf ). and that takes into account all known sources 
of systematics, such as monopole/dipole and foreground 
marginalization. We also present the first proper compu- 
tation of the Bayesian evidence for the modulated model. 

Following the first report of the power asymmetry, 
much effort has been spent by theorists on providing pos- 
sible physical explanations. Examples range from those 
questioning the very fundamentals of physics and cosmol- 
ogy (e.g., introducing intrinsically inhomogeneous cos- 
mologies - Moffat 2005 and Jaffe et al. 2005; violation 
of Lorenz invariance - Kanno & Soda 2006; or violation 
of rotational invariance in the very early universe - Ack- 
ermann, Carroll & Wise 2007) to those essentially con- 
sidering special cases of established physics (e.g., second- 
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order gravitational effects from local inhomogeneities - 
Tomita 2005; the presence of local voids - Inoue & Silk 
2006; spontaneous isotropy breaking from non- linear re- 
sponse to long-wavelength density fluctuations - Gordon 
et al. 2005). 

2. ALGORITHMS 

We now outline the methods used for the analyses pre- 
sented in the following sections. 

2.1. Data model and likelihood 

We model the CMB temperature sky maps as 

d(n)=s(n)[l+/(n)]+n(n), (1) 

where s(n) is a statistically isotropic and Gaussian ran- 
dom field with power spectrum Ci, /(n) is a dipole mod- 
ulation field with amplitude less than unity, and n(n) is 
instrumental noise. Thus, the modulated signal compo- 
nent is an anisotropic, but still Gaussian, random field, 
and therefore has a covariance matrix given by 

S(n, m) = [1 + /(n)]S(n, rh)[l + /(m)], (2) 

where 

S(n,m) = ^-V(2£ + l)C^(n-m). (3) 

e 

Taking into account instrumental noise and possible 
foreground contamination, the full covariance matrix is 

C(n,m) = S(n,rh) + N + F. (4) 

The noise and foreground covariance matrices depend on 
the data processing, and will be described in greater de- 
tail in f}3l With these definitions ready at hand, the 
log-likelihood is given by 

-21og£ = d T CT 1 d + log|C|, (5) 

up to an irrelevant constant. 

2.2. Posterior distributions and choice of parameters 

The posterior distribution P(8\d) is a primary goal of 
any Bayesian analysis, 6 being the set of all free param- 
eters in the model. For the model defined above, the 
free parameters can be divided into two groups, namely 
those describing the isotropic CMB covariance matrix or 
and those describing the modulation field. Both may 
be parametrized in a number of different ways, and these 
choices may affect the outcome of the analysis through 
different prior definitions. 

First, for the isotropic CMB component, we choose 
to parametrize the power spectrum in terms of a simple 
two-parameter model with free amplitude q and tilt n, 

<a -«(£)'<*'■ M 

Here £q is a pivot multipole and is a fiducial model, 
in the fo llowing chosen t o be t he best-fit power law spec- 
trum of Hinsh aw et al.l (|2007f h Second, the modulation 
field f(n) is parametrized in terms of a direction p and 
an overall amplitude A, 



We use flat priors on all parameters in this paper; 
the modulation axis is uniform over the sphere, and the 
amplitude is restricted to A < 0.3. The power spec- 
trum parameters are restricted to 0.5 < q < 1.5 and 
—0.5 < n < 0.5. These choices are sufficiently generous 
to include all non-zero parts of the likelihood. 

The posterior distribution, 

P(q, n, A, p|d) oc £(q, n, A, p)P{q, n, A, p), (8) 

is then mapped out using a standard Markov Chain 
Monte Carlo technique. We use a Gaussian proposal den- 
sity for q, n, and A, and an Euler-matrix based, uniform 
proposal density for p. 

2.3. Bayesian evidence and nested sampling 

In a Bayesian analysis, one is not only interested in the 
set of best-fit parameter values, but also in the relative 
probability of competing models. The most direct way 
of measuring this is through the Bayesian evidence, 

E = P(d\H) = J P(d\6,H)P(0\H)d9, (9) 

which is simply the average likelihood over the prior vol- 
ume. Typically, one computes this quantity for two com- 
peting models, Hq and H±, and considers the difference 
Alog-E = log i?i — log i?o. If Alogi? > 1, the evidence 
for Hi is considered substantial; if Alogi? > 2.5, it is 
considered strong. 

Traditionally, computation of evide nces has been a 
compu tational challenge. However, iMukhcric e et al.l 
(2006) intro duced a, meth od called "nested sampling", 
proposed bv lSkillirigl (|2004D . to the cosmological commu- 
nity that allows for accurate estimation of the evidence 
through Monte Carlo sampling. We implemented this for 
the priors and likelihood described above, and found that 
it works very well for the problem under consideration. 

2.4. Maximum-likelihood analysis 

We also perform a standard frequentist maximum- 
likelihood analysis by computing the maximum- 
likelihood modulation parameters for isotropic Monte 
Carlo simulations. For these computations, we use a 
modified version of the evidence code, which we find to 
be considerably more robust than a simple non-linear 
search; while the non-linear search algorithms often get 
trapped in local minima, the nested sampling algorithm 
always find the correct solution, but of course, at a con- 
siderably higher computational expense. 

3. DATA 

We analyze two versions of the three-year WMAP sky 
maps in the following; the template-corrected V-, and 
W^-band maps, and the "foregro und cleaned" Interna l 
Linear Combination (LLC) map (jHinshaw et al.ll2007f). 
All m aps are processed as described by lEriksen et al.l 
(2006): They are first downgraded to HEALPix 7 res- 
olution Afgjjg = 16, by additional smoothing to a 9° 
FWHM Gaussian beam and appropriate pixel window. 
Second, uniform Gaussian noise of On = 1 A*K is added 
to each pixel in order to regularize the pixel-pixel covari- 
ance matrix. This combination of smoothing and noise 



/(n) =Afip. 



(7) 



7 http://hcalpix.jpl.nasa.gov 
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1LC; 12,8% cut 
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Fig. 1. — Posterior distributions for the dipole modulation am- 
plitude, marginalized over direction and CMB power spectrum. 



TABLE 1 



Data 


( Z bf > fe bf) 


A bf 


Alog£ 


A log,E 


P 


ILC a 


(225°, -27°) 


0.114 


6.1 


1.8 ±0.2 


0.991 


ILC b 


(208°, -27°) 


0.125 


6.0 


1.8 ±0.2 


0.991 


Q-band b 


(222°, -35°) 


0.124 


5.5 


1.5 ±0.2 


0.987 


V-band b 


(205°, -19°) 


0.127 


5.6 


1.5 ±0.2 


0.990 


VK-band b 


(204°, -31°) 


0.121 


5.2 


1.3 ±0.2 


0.985 



Note. — Modulation model results. The listed quantities 
are the marginal best-fit dipole axis (second column) and 
amplitude (third column); the change in likelihood at the 
posterior maximum, A log C = log £ moc j — log £j so , between 
the modulated and the isotropic model (fourth column); the 
Bayesian evidence difference, A log E = log £ moc i — log E- lso 
(fifth column); and the frequentist probability for obtaining 
a lower maximum-likelihood modulation amplitude than the 
observed one, computed from isotropic simulations (sixth col- 
umn). 

a Liberal 12.8% sky cut imposed. b Conservative 36.3% sky 
cut imposed. 

level results in a signal-to-noise ratio of unity at I = 40, 
and strong noise domination at the Nyquist multipole of 
£ = 47. 

We use two different sky cuts for our analyses. First, 
given that the galactic plane is clearly visible in the sin- 
gle frequency data, our first mask is conservatively de- 
fined. This cu t is cre ated by expanding the Kp2 mask 
iHinshaw et al.l (|2007h by 9° in all directions, and then 
manually removing all near-galactic pixels for which any 
difference map between two channels are clearly larger 
than noise. In total, 36.3% of all pixels are rejected by 
this cut (see figure [5]). Second, we also adopt the directly 
downgraded Kp2 cut used by the WMAP team that re- 
moves 12.8% of all pixels. We use this mask for the ILC 
map only. 

The noise covariance matrix is given by the uniform 
noise only, N^j — cr^Sij. For completeness, we have also 
computed the noise covariance from the smoothed instru- 
mental noise for the V-band data, but we find that this 
has no effect on the final results, since its amplitude is 
far below the CMB signal. It is therefore omitted in the 
following. 

As an additional hedge against foreground contamina- 
tion, we marginalize over a set of fixed spatial templates, 
tj, through the covariance matrix Fj = ctititj , on > 10 3 . 




C. 12.8% cut 
Q-band, 36.3% ait 
V-band, 36.3% cut 




0.017 



Fig. 2. — Posterior distribution for the dipole modulation axis, 
shown for the ILC map and 36.3% sky cut, and marginalized over 
power spectrum and amplitude parameters. Grey sky pixels in- 
dicate pixels outside the 2a confidence region. The dots indicate 
the axis 1) reported by Eriksen et al. (2004) in white; 2) for the 
ILC map with a 12.8% sky cut in green; 3) for the Q-, V-, and 
VK -bands in red, blue an d yellow, respectively. The axis reported 
bv lSpergel e t al. (2007a) coincides with the W-band axis. 

Monopole and dipole terms are always included, and one 
or more foregro und templates. For th e V-band and ILC 
maps, we follow IHinshaw et al.l (|2007h and adopt V-TLC 
as our foreground template. Fo r the Q-band data, we 
marginal ize over a synchr otron (Ha slam et al.l Il982f) . a 
free-f ree (jFinkbeinerl(2003j) . and a dust ( Finkb einer et al.l 
1999) template individually. Finally, for the W-band 
data, we use the W-ILC difference map. However, we 
have tried various combinations for all maps, and there 
is virtually no sensitivity to the particular choice, or in- 
deed, to the template at all, due to the conservative sky 
cut used. 



4. RESULTS 

The results from the analysis outlined above are sum- 
marized in Table [TJ For each map, we report the best-fit 
dipole axis and amplitude, as well as the maximum log- 
likclihood difference and Bayesian evidence difference for 
the modulated versus the isotropic model. The errors on 
the evidence are estimated by performing eight indepen- 
dent analy ses for each case, and c omputing the standard 
deviation (|Mukheriee et al.ll2006l ). We also compute the 
probability of obtaining a smaller modulation amplitude 
than the observed one by analyzing 1000 isotropic Monte 
Carlo simulations. 

Starting with the first case in Table [TJ the ILC map 
cut by a 12.8% mask, we see that the best-fit modula- 
tion axis points toward (/,&) = (225°,— 27°), and the 
corresponding modulation amplitude is 0.114. The raw 
likelihood improvement is Alog£ = 6.1. The probabil- 
ity of finding such a high modulation amplitude in in- 
trinsically isotropic simulations is ~ 1%, and, finally, the 
improvement in Bayesian evidence is AlogE 1 = 1.8. 

Further, these results are not sensitive to data set or 
sky coverage: Even the Q-band map, which presum- 
ably is the least reliable with respect to residual fore- 
grounds, yields a modulation amplitude which is high at 
the 98.7% (frequentist) confidence level, and a Bayesian 
log-evidence improvement of 1.5. This frequency inde- 
pendence is further illustrated in Figure[JJ where we show 
the marginalized posterior distributions for the modula- 
tion amplitudes for each data set. The agreement among 
data sets is very good. 

In Figure [2] we show the dipole axis posterior distri- 
bution for the ILC map and 36.3% sky cut. Superim- 
posed on this, we have also marked th e first-y ear asym- 
metry axis reported by lEriksen et alj {2004b) [(/, b) — 



4 



(237°, —10°)] in yellow, and also the other axes listed in 
TableQ] All agree well within 2cr, and this is another tes- 
timony to the excellent stability of the effect with respect 
to statistical method, data set and overall procedure. 

Finally, we note that this model may also partially ex- 
plain the anomalous cold sp ot reported bv IVielva et al.1 
i|2004f) and lCruz et"aT1 (|2005): By demodulation the spot 
would increase its temperature by about 10%, and al- 
though still very cold, it would be significantly less ex- 
treme. Similar arguments could possi bly also be made fo r 
the Bianchi VII?,, correlation found bv lJaffe et alj (2005). 
These issues will be considered further in future work. 

5. CONCLUSIONS 

A notable power asymmetry between two opposing 
hemispher es in the first-year W MAP sky maps was re- 
ported by lEriksen et al.l J2004b) . This feature may be 
observed as strong fluctuations in the southern ecliptic 
hemisphere, but virtually no large-scale structure in the 
northern ecliptic hemisphere (e.g., Hinshaw et al. 2007). 

In this Letter, we have revisited this issue in the three- 
year WMAP data, adopti ng the statistic a l frame work in- 
troduced and applied by ISpergel et al.l (|2007af ) . With 
these tools, we find that the evidence for power asym- 
metry in the WMAP data is very consist ent with that 
initially reported for the first-year maps bv lEriksen et all 
(|2004bf ). and the WMAP data clearly suggest a dipolar 
distribution of power on the sky: The best-fit modulation 
amplitude is roughly 12% in real space, or about 20% in 
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terms of power spectra. The corresponding dipole direc- 
tion is (l,b) ~ (225°, —27°). All results are independent 
of data set choices, i.e., frequency channel or sky cut. 

However, the statistical evidence for this effect is still 
only tentative. In frequentist language, the significance 
is about 99%, while in Bayesian terms, the log-evidence 
difference is ~ 1.5 to 1.8, corresponding to odds of one 
to five or six. This is quite comparable to the evidence 
for ns 7^ 1 after the three-year WMAP data release, for 
which the odds are about one to eight in the highest 
case (Parkinson et al. 2006). Thus, there is still a chance 
that the effect may be a fluke, and most likely, this will 
remain the situation until Planck provides new data in 
some five years. With additional frequency coverage, a 
better job can be done on foreground treatment, and 
more sky coverage can be reliably included in the anal- 
ysis. Second, full-sky and high-sensitivity polarization 
data should provide valuable insights on the origin of 
the effect. 
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